Documentation update for 1.19 #597

PatrykWo · 2024-12-05T12:44:45Z

No description provided.

Fixing the following flags setting: cutlass_fp8_supported, use_marlin to False for HPU

Reverts HabanaAI#551 Different formatting

Revert changes in lm eval test

mgawarkiewicz · 2024-12-05T13:00:01Z

README_GAUDI.md

+
+#### 1. Build and Install the stable version
+
+Periodically, we are releasing vLLM to allign with Intel® Gaudi® software releases. The stable version is released with a tagg, and supports fully validated features and performance optimizations in Gaudi's [vLLM-fork](https://github.com/HabanaAI/vllm-fork). To install the stable release from [HabanaAI/vLLM-fork](https://github.com/HabanaAI/vllm-fork), run the following:


michalkuligowski · 2024-12-10T09:29:03Z

README_GAUDI.md

+```{.console}
+$ git clone https://github.com/HabanaAI/vllm-fork.git
+$ cd vllm-fork
+$ git checkout v1.19.0


Probably will need to replace with proper tag like in v0.5.3.post1+Gaudi-1.18.0

@bartekkuncer please verify if that makes sense.

@michalkuligowski makes a good point especially that in release note we provide instruction with the use of a tag, so this change will make these two consistent.

michalkuligowski · 2024-12-10T09:29:31Z

docs/source/getting_started/gaudi-installation.rst

-   $ cd vllm
+   $ git clone https://github.com/HabanaAI/vllm-fork.git
+   $ cd vllm-fork
+   $ git checkout v1.19.0


Probably will need to replace with proper tag like in v0.5.3.post1+Gaudi-1.18.0

@bartekkuncer please verify if that makes sense.

@michalkuligowski makes a good point especially that in release note we provide instruction with the use of a tag, so this change will make these two consistent.

README_GAUDI.md

piotrbocian · 2024-12-19T10:24:59Z

README_GAUDI.md

@@ -11,7 +11,7 @@ Please follow the instructions provided in the [Gaudi Installation Guide](https:
 - OS: Ubuntu 22.04 LTS
 - Python: 3.10
 - Intel Gaudi accelerator
- Intel Gaudi software version 1.18.0
+- Intel Gaudi software version 1.19.0

 ## Quick start using Dockerfile
 ```


It needs more explanation.

Will you add it? @piotrbocian

Or explain what you have in mind so someone else can do it?

Other vendors seem to have it in the similar way as here:
https://docs.vllm.ai/en/latest/getting_started/openvino-installation.html#quick-start-using-dockerfile
https://docs.vllm.ai/en/latest/getting_started/cpu-installation.html#quick-start-using-dockerfile
https://docs.vllm.ai/en/latest/getting_started/arm-installation.html#quick-start-with-dockerfile
https://docs.vllm.ai/en/latest/getting_started/xpu-installation.html#quick-start-using-dockerfile

Please see how document is structured:

Quick start using Dockerfile

Build from source
2.1 Environment verification
2.2 Run Docker Image
2.3 Build and Install vLLM

Questions:

is (1.) full alternative to (2.)? If so, I would add one liner as

"You can quickly set up vLLM using latest Intel Gaudi docker and vllm verson "

is (2.1 Env verification) common to (1.) and (2.)?

README_GAUDI.md

docs/source/getting_started/gaudi-installation.rst

Co-authored-by: Piotr Bocian <[email protected]>

Merges #507 and #597, updates changelog and adds minor changes. --------- Co-authored-by: Bartosz Kuncer <[email protected]>

michalkuligowski and others added 21 commits November 25, 2024 10:18

Update ray_hpu_executor.py (HabanaAI#541)

a9b9e23

fix flags setting on HPU for FP8LinearMethod

79edcf2

Update sh scripts (HabanaAI#546)

cb1ba00

Update cpu-test.yml (HabanaAI#543)

03ef71a

Fix flags setting on HPU for FP8LinearMethod (HabanaAI#551)

087c304

Fixing the following flags setting: cutlass_fp8_supported, use_marlin to False for HPU

Revert "Fix flags setting on HPU for FP8LinearMethod" (HabanaAI#553)

36d872d

Reverts HabanaAI#551 Different formatting

Update run-lm-eval-gsm-vllm-baseline.sh (HabanaAI#554)

e04615d

Revert changes in lm eval test

1.19.0 fast-forward merge (HabanaAI#542)

79e37ad

Update documentation

0b5bf99

Update README_GAUDI.md

58be7bc

Update gaudi-installation.rst

b8136a3

Update compatibility_matrix.rst

e19bd83

Update compatibility_matrix.rst

eb631ef

Update gaudi-installation.rst

3b624ac

Update compatibility_matrix.rst

5f689dd

Update compatibility_matrix.rst

b2532a0

Update Dockerfile.hpu

d8b7ae0

Update README_GAUDI.md

647c19c

Update gaudi-installation.rst

3ecd3c0

Update README_GAUDI.md

213c716

Updates to the documentation

d6bfbaf

mgawarkiewicz reviewed Dec 5, 2024

View reviewed changes

PatrykWo added 4 commits December 5, 2024 16:51

Update README_GAUDI.md v2

ae2a931

Merge branch 'HabanaAI:habana_main' into doc_update

71bc8e9

Update README_GAUDI.md v3

b138eb9

Update gaudi-installation.rst

4f28b61

michalkuligowski requested changes Dec 10, 2024

View reviewed changes